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ABSTRACT 

The highly conserved Paf1 complex (PAF1C) plays 
critical roles in RNA polymerase II transcription 
elongation and in the regulation of histone modifi- 
cations. It has also been implicated in other diverse 
cellular activities, including posttranscriptional 
events, embryonic development and cell survival 
and maintenance of embryonic stem cell identity. 
Here, we report the structure of the human Paf1/ 
Leo1 subcomplex within PAF1C. The overall struc- 
ture reveals that the Paf1 and Leo1 subunits form a 
tightly associated heterodimer through antiparallel 
beta-sheet interactions. Detailed biochemical 
experiments indicate that Leo1 binds to PAF1C 
through Paf1 and that the Ctr9 subunit is the 
key scaffold protein in assembling PAF1C. 
Furthermore, we show that the Paf1/Leo1 
heterodimer is necessary for its binding to histone 
H3, the histone octamer, and nucleosome in vitro. 
Our results shed light on the PAF1C assembly 
process and substrate recognition during various 
PAF1C-coordinated histone modifications. 

INTRODUCTION 

In eukaryotes, most DNA is wrapped around a histone 
octamer to form a nucleosome. Transcription elongation, 
mediated by RNA polymerase II (Pol II), occurs in the 
context of nucleosomal arrays within the chromatin; 
however, nucleosomes may become an obstacle for Pol 
II passage. During elongation, various specific Pol II- 
associated factors may facilitate transcriptional elongation 
and modulate the elongation complex at multiple stages 
(1). The Pafl complex (PAFIC) was originally identified 



in Saccharomyces cerevisiae as an RNA polymerase II- 
associated factor (PAF) (2) and contributes to multiple 
aspects of Pol II transcriptional regulation (3,4). 

PAFIC is a multifunctional complex comprising the 
Pafl, Leol, Ctr9, Cdc73 and Rtfl subunits in budding 
yeast (5,6). Human PAFIC contains an additional 
subunit, Ski8/Wdr61 (7). PAFIC is associated with the 
promoter and coding sequences of active genes (8,9) and 
is recruited to chromatin through both Cdc73 and Rtfl 
(10-12). PAFIC has been implicated in transcription 
elongation through physical and functional interactions 
with various elongation factors, including Sptl6-Pob3/ 
FACT, Spt4-Spt5/DSIF and TFIIS (7,13,14). 

Beyond its role in transcription elongation, PAFIC 
has established roles in H2B monoubiquitylation and 
downstream histone H3 methylation. The recruitment of 
the Brel/Rad6 ubiquitin conjugase-ligase complex by 
PAFIC is essential for H2B monoubiquitylation (15,16), 
which is a prerequisite for the subsequent methylation 
of H3K4 and H3K79 by the Set 1 /COMPASS and Dotl 
methyltransferases, respectively (15,17-20). Moreover, 
various studies have implicated PAFIC function in other 
diverse cellular processes, including the processing of 
mRNA and small nucleolar RNA during post- 
transcriptional events (12,21-23), mammalian embryonic 
development and survival in adults (24,25) and embryonic 
stem cell identity maintenance (26). Additional evidence 
implicates PAFCl in immunological responses to disease 
states and cancer progression (27,28). 

To date, structural information on PAFCl is limited. 
Thus far, the plus 3 domain of Rtfl (29) and the Ras-like 
C-domain of Cdc73 (30,31) have been determined. No 
structural data on PAFIC formation has been reported. 
In this study, we determine the crystal structure of the 
human Pafl /Leol subcomplex to elucidate the assembly 
of PAFCl and the mechanisms underlying its function. 
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MATERIALS AND METHODS 

Expression and purification 

To co-express Pafl and Leol, DNA fragments corres- 
ponding to human Pafl (full-length residues 1-531) and 
Leol (full-length residues 1-666) were polymerase chain 
reaction (PCR)-amplified and cloned into an in-house 
modified version of the pET32a vector (Novagen). ribo- 
some binding sequence (RBS)-Leol, which contains an 
RBS on the N-terminus of Leol, was PCR-amplified 
from pET32a-Leol and inserted into pET32a-Pafl. The 
resulting proteins contained a Thioredoxin (Trx)-his6 tag 
on the N-terminus of both Pafl and Leol. 

The recombinant proteins were expressed in BL21(DE3) 
Codon Plus Escherichia coli cells at 16°C for 16-18 h. The 
cells were then lysed by AH- 1500 (ATS Engineering 
Limited). The Trx-Hisg-tagged protein complex was 
purified by Ni-NTA affinity chromatography (QIAGEN) 
followed by size-exclusion chromatography on a HiLoad 
26/60 Superdex 200 (GE Healthcare) in 50 mM Tris, pH 
8.0, 100 mM NaCl, 1 mM EDTA and 1 mM dithiothreitol 
(DTT). The Trx-Hisg-tagged complex proteins were 
further purified on a Hiprep Q FF 16/10 anion-exchange 
column (GE Healthcare). The final purification step was 
size-exclusion chromatography on a HiLoad 26/60 
Superdex 200 column in 50 mM Tris, pH 8.0, 200 mM 
NaCl, ImM EDTA and 1 mM DTT. The sodium 
dodecyl sulphate-polyacrylamide gel electrophoresis 
(SDS-PAGE) analysis of the recombinant Trx-Paf 1 /Trx- 
Leol complex is shown in Supplementary Figure SIA. 

We used similar co-expression strategies to prepare 
various truncations of the Pafl /Leol complex, including 
residues 57-392 of Pafl and residues 277-562 of 
Leol (referred to as Pafl<"-^''''^VLeol<^""^^^>, similar 
nomenclature hereafter), Pafl^'^-^^^VLeol*^""^''-), 

p^fl(161-392)/Leol(27V-562)^ Pafl*'"-"^VLeol(277-562)^ 

p^fl(i6i-250)/Leol(2vv-562)^ Pafl^'^'-^^^VLeol*""^"*, 
p^fl(i6i-250)/Leol(27^^95)^ p^fl(i6i-250)/Leol'"°-4''5> and 
p^fj(i6i-250)yLgQj(37o^62) ^jj gxprcsslon and puri- 

fication processes were similar to the procedures for 
the full-length complex. The SDS-PAGE analyses of all 
recombinant protein complexes are shown in 
Supplementary Figures SI and S2. 

To make a single-chain fusion protein of the 
p^j-j(i6i-250)^Leol<"°-^'^^^ complex (hereafter named 
Pj^f-j(i6i-250)_LgQj(37o^62)-)^ DNA fragments corresponding 

to the Pafl residues 161-250 (Pafl<'""^^°') and Leol 
residues 370-462 (Leol<"°"^*-') were amphfied by PGR 
and linked with a tobacco etch virus (TEV) protease- 
cleavable segment (Glu-Asn-Leu-Tyr-Phe-Gln-Ser). Two 
amino acids (Ser-Gly) were inserted before the TEV 
protease-cleavable segment. The single open reading 
frame was cloned into an in-house modified version of 
the pET32a vector. The resulting protein contained a 
Trx-Hisg tag on its N-terminus. 

The recombinant protein was expressed in BL21(DE3) 
Codon Plus E. coli cells at 16°C for 16-18 h. The Trx-Hisg- 
tagged protein was purified by Ni-NTA affinity chroma- 
tography, followed by size-exclusion chromatography on 
a HiLoad 26/60 Superdex 200 column in 50 mM Tris, pH 
8.0, 100 mM NaCl, ImM EDTA and ImM DTT. After 



digestion with PreScission Protease to cleave the 
N-terminal Trx-His^ tag, the target protein was purified 
on a Hiprep Q FF 16/10 anion-exchange column. The 
final purification step was size-exclusion chromatog- 
raphy on a HiLoad 26/60 Superdex 200 column in 
20 mM N-2-Hydroxyethylpiperazine-N'-2-ethanesulfonic 
acid (HEPES), pH 7.5, 50 mM NaCl, ImM EDTA and 
1 mM DTT. 

Crystallization and data collection 

Wild-type Pafl"""^^°>-Leol'"°"^*^' crystals were grown 
at 20°C using the sitting-drop vapor-diffusion method. 
The reservoir solution contained 3.5 M sodium formate, 
pH 7.0. Initial crystals were needle-shaped and unsuitable 
for diffraction. We used several additives for further 
screening and found that AdoMet or cyclic AMP could 
significantly improve crystal formation. Finally, sitting 
drops of 3 1^1 total volume were prepared from 1 |il 
p^j-j(i6i-250)_Lgoi(37o-462) ^^q^^^,^ (30 mg/ml, 20 mM 

HEPES, pH 7.5, 50 mM NaCl, ImM DTT and ImM 
EDTA) mixed with 1 \A of reservoir solution and 1 ^1 
AdoMet (20 mg/ml). Tetragonal crystals appeared within 
1 week. Diffraction data were collected on beam fine 
BL17U1 of the Shanghai Synchrotron Radiation Facility 
(SSRF). The data were processed and scaled to 2.5 A using 
the HKL2000 software package (32). The Pafl^^'-^so). 
Leol*^^™""*^^' crystal belongs to space group R32 with 
unit cell dimensions a = h =115.2A, c = 157.4 A. 

Se-Met-substituted protein was expressed in B834 cells 
using LeMaster medium. The purification and crystalliza- 
tion conditions were the same as those described above, 
except the reservoir solution was supplemented with 4% 
1,4-butanediol. The single anomalous dispersion (SAD) 
data set for the Se-Met-substituted Pafl "2^°*- 
was collected at the peak wavelength 
for Se on beam line BL17U1 of the SSRF. The anomalous 
signal data set was processed and scaled to 2.8 A using the 
HKL2000 software package (32). 

Structure determination and refinement 

The HKL2MAP program (33) yielded eight out of 10 the- 
oretical Se sites. Initial SAD phases were then calculated 
using PHENIX software (34). The model was built 
manually using the COOT program (35). After the 
initial alanine model was built, iterative refinement was 
performed against the wild-type data to assign all side 
chains. After several refinement cycles with the PHENIX 
program (34) and COOT (35), the orientations of amino 
acid side chains and bound water molecules were modeled 
on the basis of 2Fo-Fc and Fo-Fc difference Fourier 
maps. The final structure had an /?cryst value of 23.2% 
and an Rf^ee value of 26.8%. The Ramachandran plot 
calculated by the PROCHECK program (36) showed 
that 93.5% of the residues were in their most favored 
regions, 6.5% of the residues were in additionally 
allowed regions and no residues were in generously 
allowed regions or disallowed regions. Detailed data 
collection and refinement statistics are summarized in 
Supplementary Table SI. 
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Glutathione S-transferase pull-down assays 

H3""^**'-glutathione S-transferase (GST) and 
H3^'"^**'(K4A, R17A)-GST fusion proteins were expressed 
in E. coli BL21(DE3) Codon Plus cells and purified using a 
glutathione-Sepliarose 4B column (GE Healthcare) and a 
HiLoad 26/60 Superdex 200 column in 50 mM Tris, pH 
8.0, lOOmM NaCl, 1 mM EDTA and 1 niM DTT. 
Transfected HEK293T cells were lysed using 1 ml of ice- 
cold cell lysis buffer [50 mM Tris-HCl, pH 7.4, 150mM 
NaCl, 3% glycerol, 0.5% NP40, 0.5% Triton X-100 and 
1 mM phenyl-methylsulphonylfluoride (PMSF)] and 
cleared by centrifugation at 13 000rpm for 30min at 
4°C. Soluble fractions were incubated with GST-fusion 
proteins coupled to glutathione-Sepharose 4B beads at 
4°C for 2h. The beads were washed with cell lysis buffer 
three times and boiled in SDS sample buffer. The prepared 
samples were then separated by SDS-PAGE and analyzed 
by western blots. 

GST-Pafl/Trx-Leol fusion proteins were expressed in 
E. coli BL21 (DE3) Codon Plus cells and purified using a 
glutathione-Sepharose 4B column (GE Healthcare) and a 
HiLoad 26/60 Superdex 200 column in 50 mM Tris, pH 
8.0, lOOmM NaCl, 1 mM EDTA and 1 mM DTT. Human 
Histone octamer was prepared according to a modified 
method (37). GST fusion proteins were incubated with 
glutathione-Sepharose 4B beads at 4°C for 2h. The 
beads were washed three times with the buffer (50 mM 
Tris-HCl, pH 8.0, lOOmM NaCl and 5% glycerol). 
Histone octamer was incubated with GST fusion 
proteins coupled to glutathione-Sepharose 4B beads at 
4°C for 2h. The beads were washed three times with the 
buffer (50 mM Tris-HCl, pH 8.0, 100 niM NaCl and 5% 
glycerol). The prepared samples were then separated by 
SDS-PAGE and the gel was stained with Coomassie blue. 

Co-Immunoprecipitation 

HEK293T cells were transfected with the indicated 
combinations of plasmids. At 24 h after transfection, 
HEK293T cells were lysed using 1 ml of ice-cold cell 
lysis buffer (50 mM Tris-HCl, pH 7.4, 150mM NaCl, 
3% glycerol, 0.5% NP40, 0.5% Triton X-100 and ImM 
PMSF) and cleared by centrifugation at 13 000rpm for 
30min at 4°C. The supernatants were then incubated 
with argarose conjugated anti-Myc (clone PL 14, Medical 
& Biological Laboratories) for 2h at 4°C. The argarose 
beads were washed three times with cell lysis buffer and 
eluted with SDS sample buffer. Samples were then sub- 
jected to SDS-PAGE and western blot analysis. 

Dot-blot overlay assay 

Recombinant human histone octamer and nucleosome 
were expressed, purified and assembled with a 147-bp 
DNA sequence as described previously [Supplementary 
Figure S6B; (37)]. The purified Pafl/Leol, Pafl/ 
Leo 1(370-666)^ Pafl('-"^VLeol and Pafl^'""^*/ 
Leol^"°"''^*' complex proteins were stored in buffer 
(50 mM Tris, pH 8.0, 100 mM NaCl, ImM EDTA and 
1 mM DTT). Each recombinant protein was first 
resolved on a 12% SDS-PAGE gel to normalize the 



sample inputs before the proteins were absorbed onto 
polyvinylidene difluoride (PVDF) membranes (Millipore). 
The membranes were subsequently blocked with 10% 
nonfat milk in TBST (50 mM Tris-HCl, pH 7.4, 150mM 
NaCl and 0.1% Tween 20) for 1 h. The membranes were 
then incubated with 10|.Lg/ml histone octamer or nucleo- 
some in TBST at room temperature for 1 h. After extensive 
washing, the membrane-bound histone octamer or nucleo- 
some core particle was immunoblotted using anti-H3 
antibody. Finally, the membrane was stained using 
Coomassie blue to confirm the sample loading. 

Analytical gel filtration 

Size-exclusion chromatography was performed on an 
AKTA FPLC system using a Superose 12 10/300 column 
(GE Healthcare) for the single-chain fusion protein 
p^fj(i6i-250)_Leoi(37o-462) ^ Supcrdcx 200 10/300 

column (GE Healthcare) for the other Pafl/Leol complex 
proteins shown in Supplementary Figures SI and S2. The 
column was equihbrated in buffer containing 50 mM Tris, 
pH 8.0, 200 mM NaCl, 1 mM EDTA and 1 mM DTT. 

Analytical ultracentrifugation 

Sedimentation velocity (SV) experiments were performed 
in a Beckman Coulter XL-1 analytical ultracentrifuge 
(Beckman Coulter) using double sector centerpieces and 
sapphirine windows. SV experiments were conducted at 
42 000rpm and 4°C using interference detection for 
p^fl(i6i-250)_Leoi(37o-462)^ 40 QOO rpm for Pafl'"-^'^)/ 

Leol'^""^*^^' and 36 000 rpm for Trx-Paf 1 /Trx-Leo 1 . The 
SV data was analyzed using the SEDFIT program (38,39). 



RESULTS 

Interaction between Pafl and Leol 

Previous studies have shown that in human PAFIC, Pafl 
and Leol form a binary subcomplex (7,19). We first con- 
firmed the Pafl/Leol interaction by showing that two 
Thioredoxin (Trx)-tagged fusion proteins (Trx-Pafl and 
Trx-Leo 1) form a heteromeric complex with a stoichiom- 
etry of 1:1 in the analytical size-exclusion column 
(Supplementary Figure SIA). Analytical ultracentrifuga- 
tion further confirmed that the Pafl/Leol subcomplex 
assembles into a heterodimer with a molecular mass of 
~172kDa (Supplementary Figure SID). The Pafl/Leol 
subcomplex protein, however, is unstable (SDS-PAGE 
insert in Supplementary Figure SIA). Using a combin- 
ation of limited proteolysis followed by size exclusion 
and mass spectrum fingeiprinting, we identified a region 
in Pafl (aa 57-392, Pafl^""'*''^') and a region in Leol 
(aa 277-562, Leol*^""^''^') that also form a 1:1 stoichio- 
metric heterodimer complex (Supplementary Figures SIB 
and D). Using a similar truncation-based approach, we 
mapped the minimal Leol -binding region of Pafl to a 
90-residue fragment (aa 161-250, Pafl<'^'"^^°') and the 
minimal Pafl -binding region of Leol to a 93-residue 
fragment (aa 370-462, Leol^""""^*^') (Figure lA and B, 
and Supplementary Figure S2I). Given these results, we 
conclude that the Pafl/Leol subcomplex forms a 
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heterodimer through interaction between fragments 
p^fl(i6i-250) .^^^ Leol<"°-«2). 

Overall structure of the Pafl/Leol heterodimer 

To understand how Pafl and Leol bind to each other, 
we attempted to determine the crystal structure of the 
p^fj(i6i-2io) Leol*'™"'''^' heterodimer using two 

separate chains; these experiments were unsuccessful. 
However, we succeeded in obtaining crystals of the single 
polypeptide created by the fusion of Pafl("''-250) ^j^g 
terminus of Leol*"^™""^^^' with a TEV-cleavable segment. 
The purified single-chain fusion protein of pafl^'*'"^^"' 
and Leol<^™-^''2> (herein named Pafl^'^'-^^^'-Leol'^^'^''^^) 
eluted as a single peak from an analytical size-exclusion 
column and assembled into a heterodimer with a molecular 
mass of ~25kDa from the SV experiment (Supplementary 



Figure SIC and D). The crystal structure of Pafl^'"-^^"^- 
Leol^^^"""^*^' was determined at a resolution of 2.5 A and 
with one molecule per asymmetric unit (Supplementary 
Table SI). S-adenosyl-L-methionine (AdoMet) was 
added into the crystallization solution to facilitate the for- 
mation of the crystal complex. In the final model, all 
residues are clearly visible, except residues 399^06 of 
Leol*^^"""**"', presumably owing to conformational flexibil- 
ity (Figure IC and D). The entire complex was butterfly 
shaped, with the Leol segment encompassed by the Pafl 
segment. The pafl''*'~^^°' segment consists of two a-lielices 
(Pal-Pa2) and three |3-strands (Ppi-Pp3), while the 
Leol*"'^°~'**~^ segment consists of one a- helix (Lai) and 
six p-strands (Lpi-Lp6). 

Part of an AdoMet molecule is visible and modeled 
based on the 2Fo-Fc electron density map (Figure IC 
and D). A detailed structural analysis shows that one 
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Figure 1. Crystal structure of the human Pafl/Leol subcomplex. (A and B) Gel filtration-based test of the binding of Pafl and Leol by the 
co-expression of target proteins. '+' indicates protein binding, according to the supporting data in Supplementary Figures SI and S2; '— ' indicates 
no protein binding. The protein fragments of the complex used for structural determination were Pafl''^'^^™' colored in cyan, and Leol*^'"^''"^ 
colored in magenta. (C) A cartoon representation of the structure of the Pafl*'^'"^^''*-Leol*"''''^'^* heterodimer complex. Pafl is shown in cyan, and 
Leol is shown in magenta. The bound AdoMet is shown as a stick model and colored in yellow. (D) A side view of the Pafl*"'"^^'''-Leol*^'''^''"^ 
heterodimer complex. The N- and C-termini of the two proteins are labeled. For clarity, the P-strand formed by the artificial linker is not shown. 
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AdoMet molecule has several interactions with two 
symmetry-related molecules of the Pafl*'^'~^^°V 
Leol'^™""^^^^ heterodimer in the crystal (Supplementary 
Figure S3A). The adenine moiety of AdoMet is stabilized 
by the hydrophobic pocket formed by Pafl in one 
molecule, which consists of residues V204, P206 and 
1190 (Supplementary Figure S3B and C). Moreover, the 
N6 amino group of the adenine moiety forms a hydrogen 
bond with the main-chain carboxyl group of K188 and 
A 186 in the same molecule, while the N3 and N9 amines 
form hydrogen bonds with the main-chain carboxyl group 
of E428 of Leo 1 in the other symmetry-related molecule, 
and the 02 of the ribose moiety forms a hydrogen bond 
with the side-chain carboxyl group OEl of E428. 

Pafl and Leol interactions in the heterodimer 

In the structure of the Pafl^'^'-^^'-Leol'^''-^'^^) 
heterodimer, Ppi, Lp4, Lps and Lp6 form a set of antipar- 
allel p-sheets, which are the main interactions between 
Pafl and Leol. The Ppi strand from Pafl lies antiparallel 
to the Lp4 strand from Leol to form multiple main-chain 
hydrogen bonds (Figure 2A). Moreover, K442 from 
Lp4 forms an ionic bond with the side-chain atom of 
E203 on Ppi. It was noted that the TEV-cleavable hnker 
forms another antiparallel P-sheet with the N-terminal 
Lpl of Leol CTEV Hnker'-Lpi antiparallel sheet) 
(Supplementary Figure S4). 

We performed a series of mutagenesis studies to validate 
the interactions observed in the structure of the 



p^fjd 61 -250)_j^gQ J (370-462) complex. To probc the Ppi- 
Lp4 antiparallel interaction, we used a co-immunopre- 
cipitation (Co-IP) strategy using two different tags on 
the various truncations of Pafl and full-length Leo 1 . As 
expected, Myc-Leol specifically co-precipitated with green 
fluorescent protein (GFP)-Pafl, whereas negligible 
amounts of Myc-Leol co-precipitated with any GFP- 
Pafl mutants in which the Ppi strand was omitted 
(Figure 2B and C), indicating that the Ppi-Lp4 antiparal- 
lel sheet is vital for the Pafl /Leol subcomplex formation. 

Assembly of human PaflC 

The biochemical and structural data above demonstrated 
that the Pafl and Leol subunits can specifically bind to 
each other to form a heterodimer. We next wanted to 
investigate the role of the Pafl and Leol subcomplex for- 
mation in the assembly of human PAFIC. We used a Co- 
IP strategy. In the wild-type PAFIC, both Myc-Pafl and 
Myc-Leol specifically co-precipitated with other PAFIC 
subunits lacking Rtfl (Figure 3A, lane 2, and Figure 3B, 
lane 2). In the mutant PAFIC, which contained the myc- 
Pafl(APpi) mutant, myc-Pafl(APpi) could also co- 
precipitate with Ctr9, Cdc73 and Ski8, but not Leol or 
Rtfl (Figure 3A, lane 3). In contrast, myc-Leol did not 
co-precipitate with any proteins (Figure 3B, lane 3), 
indicating that Leol binds to PAFIC through the Pafl 
subunit. In the same experiments, when the Ctr9 subunit 
was omitted, we found that Myc-Leol co-precipitated 
only with Pafl in the context of wild-type PAFIC 
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Figure 2. The interaction between Pafl and Leol. (A) The interface details in the Pafl<"''"^™'-Leol'"'^''^' heterodimer. The pi strand of Pafl (Ppi) 
is colored in cyan, and the P4 strand of Leol (Lp4) is colored in magenta. Hydrogen bonds and ionic bonds are shown as dotted red lines. (B) Co-IP 
experiments of the interaction between Leol and various Pafl constructs. Extracts were prepared from HEK293T cells transfected with various 
combinations of plasmids as indicated, immunoprecipitated with agarose-conjugated anti-GFP and subsequently immunoblotted with anti-Myc (top 
panel) or anti-GFP (bottom panel) as indicated. The top panel shows the immunoprecipitation (IP) results. The middle panel represents 2% of the 
input material Myc-Leol for each IP. The bottom panel represents the IP of GFP and various GFP fusion proteins (GFP-Pafl, GFP-Pafl(APpl), 
GFP-Pafl(NT), GFP-Pafl(CT)). Bands indicating degradation products in GFP-Pafl, GFP-Pafl(APpi) and GFP-Pafl(CT) are marked by stars. (C) 
Summary of results obtained in (B). '+' or '— ' indicates binding or no binding, respectively, between Leol and Pafl in the co-IP experiments. 
A schematic representation of the full length and mutated Pafl constructs used in the IP experiments is shown. 
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Figure 3. Assembly of human PAFIC. (A and B) Co-IP experiments of PAFIC formation by Pafl (A) and Leol (B). Extracts were prepared from 
HEK293T cells transfected with various combinations of plasmids as indicated, immunoprecipitated with agarose-conjugated anti-Myc and subse- 
quently immunoblotted with anti-Myc (upper top panels) or anti-GFP (lower bottom panels) as indicated. The top panel shows the IP results. The 
bottom panel shows 20% of the GFP fusion proteins for each IP. Nonspecific bands and the heavy chain of the antibody are marked by a star and 
asterisk, respectively. (C) Model of PAFIC assembly. The bold line represents the interaction in the crystal structure of the Pafl'^'^^^^'-Leol*^'"^*"^^ 
heterodimer. The fine lines represent the interaction obtained from the IP results. The dotted line represents interactions that we could not detect in 
the IP experiments. 



(Figure 3B, lane 4); Myc-Pafl(APpi) did not co-precipi- 
tate with any proteins in the context of mutant PAFIC 
(Figure 3 A, lane 4). These data indicated that the Ctr9 
subunit is a key scaffold protein binding to the Cdc73 
and Ski8 subunits during PAFIC formation. Together, 
these results led to the estabUshment of the interaction 
network for the assembly of human PAFIC (Figure 3C). 

The heterodimer of the Pafl/Leol subcomplex specifically 
recognizes histone 

Previous studies have demonstrated that PAFIC plays 
an essential role in transcription-coupled histone 



modifications (15,17-20,28,40^3) and that the Pafl 
subunit may be involved in histone H3 binding (28,43). 
We speculated that the Pafl/Leol heterodimer may bind 
to histone protein. To test this hypothesis, the first 28 
amino acids of the N-terniinal histone H3 were fused to 
the N-terminus of GST (H3^'-^*)-GST). In a GST pull- 
down assay, H3""^**'-GST could bind to both the Pafl 
and Leol subunits but not to any other subunits of 
PAFIC (Figure 4A). Next, we mapped each histone H3 
binding domain of Pafl and Leol using GST puU-down 
assays and found that the C-terminal region of Pafl (aa 
376-531) (Supplementary Figure S5A) and the N-terniinal 
region of Leol (aa 1-369) (Supplementary Figure S5B) are 
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essential for histone H3 binding. Importantly, we showed 
that H3*'"^**^-GST binds more strongly to the wild-type 
Pafl/Leol heterodimer than to the Pafl(APpl)/Leol 
mutant or the Pafl"""^VLeol and Pafl/Leol*"""'^'^*' 
complexes (lane 2, compared with lane 3, 4 and 5 in 
Figure 4B; the results are summarized in Figure 4C). 
The Pafl(APpi)/Leol mutant is incapable of heterodimer 
formation (lane 3 in Figure 2B), and the Pafl*'~^^^'/Leol 



and the Pafl/Leol*^ ~ ' complexes contained only one 
histone H3-binding site (cartoon in Figure 4C). These 
findings indicate that the Paf 1 /Leo 1 heterodimer contain- 
ing both histone H3-binding domains in Pafl and Leol is 
important for H3*'~^**'-GST binding. Furthermore, we 
introduced two point mutations K4A and R17A in 
H3<'"^*^*-GST (referred to as H3<'"^'^*(K4A, R17A)- 
GST) and found that the H3 mutant cannot bind to the 
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Pafl/Leol complex (Figure 4D). Together, these data 
indicate that the Paf 1 /Leo 1 subcomplex specifically recog- 
nizes the N-terminus of histone H3. 

The H3*'~^*^'-GST fusion protein contains two histone 
H3 tails because the crystal structure of GST shows that 
the protein forms a dimer (44). Consistent with this obser- 
vation, histone octamer in the nucleosome also contains 
two histone H3 units; thus, we hypothesized that PAFIC 
may specifically bind the nucleosome through the Pafl/ 



Leol heterodimer. As expected, the Pafl/Leol 
subcomplex interacted with the reconstituted histone 
octamer in a GST pull-down assay (Supplementary 
Figure 6A). Furthermore, we showed that the Pafl/Leol 
subcomplex binds specifically to both histone octamer 
(Dot No. 1 in Figure 5A) and nucleosome (Dot No. 1 in 
Figure 5C) in the dot-blot overlay assays, whereas the 
Pafl/Leo^"""''*^^ and the Pafl^-^^'/Leol complexes 
bind to histone octamer (Figure 5A and B) or nucleosome 
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Figure 5. Interaction of the Pafl/Leol subcomplex with histone octamer and nucleosome. (A and C) Dot-blot overlay assays of the Pafl/Leol 
complex and histone octamer and nucleosome. The bound histone octamer or nucleosome was detected by immunoblotting with antibody against 
histone H3 (top panel). The PVDF membrane was stained with Coomassie blue (middle panel). Each recombinant Pafl/Leol complex was resolved 
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(Figure 5C and D) with more than half the affinity of 
the wild-type Pafl/Leol complex. Interestingly, the 
Pafi(i-3V5)^Leol<"°"^^'^^ complex lacking both the 
histone H3-binding domains of Pafl and Leol binds 
weakly to both histone octamer and nucleosome (Dot 
No. 4 in Figure 5A and C), and thereby serves as a 
negative control. Together, these results indicate that the 
Pafl/Leol heterodimer may play an important role in 
recognizing substrates during various PAFlC-coordinated 
histone modifications. 



DISCUSSION 

The crystal structure of the Pafl/Leol subcomplex 
resolved in this study provides structural information 
regarding the interaction between the Pafl and Leol 
subunits. A 90-residue fragment (aa 161-250) of human 
Pafl binds to a 93-residue fragment (aa 370^62) of 
human Leol primarily through antiparallel sheet inter- 
actions (Figures 1 and 2). There is extremely high conser- 
vation of the amino acid sequence of Pafl and Leol from 
zebrafish to human (in particular, the corresponding 
regions involved in the formation of the Pafl/Leol 
heterodimer, except in fungi; Supplementary Figure S7A 
and B). Therefore, it is reasonably safe to assume that the 
structural and biochemical features of the Pafl/Leol 
subcomplex described here are shared by the complexes 
formed by the orthologs of these two proteins in other 
metazoan species. 

Although the sequence identity is low between human 
and yeast of Pafl and Leol, the predicted secondary 
structures of the corresponding sequences of yeast 
p^j^(i5o-225) Leol*"^^-2*> ai-e similar to human 

p^fj(i6i-250) Leol<"°-'^'^2>, respectively (Supplementary 
Figure S7A and B). Consistent with this observation, 
GFP-tagged yeast Pafl (GFP-yPafl) co-precipitated with 
Myc-tagged yeast Leol (Myc-yLeol), whereas 
the yPafl(A'™"'**^) mutant (lacking residues 179-187 of 
the predicted pi'-strand in yeast Pafl) co-precipitated neg- 
ligible amounts of Myc-yLeol (lane 3 compared with lane 2 
in Supplementary Figure S7C), indicating that the yeast 
Pafl/Leol subcomplex may form a heteromeric complex 
through the similar antiparallel beta-sheet interactions 
described here in the human Pafl/Leol subcomplex. 

In the crystal structure of the Pafl<'"-2^°'-Leol<"°"^'^2> 
heterodimer, we clearly observed the electron density of the 
adenine moiety and ribose moiety of AdoMet bound to two 
symmetry-related molecules of the Pafl^"''"^^"'/ 
Leol*^^"^''^^ heterodimer (Supplementary Figure S3A-C). 
The AdoMet forms hydrogen bonds with the main-chain 
carboxyl group of K188 and A 186 in Pafl and E428 in 
Leol to stabilize the corresponding loops of Pafl (loop 
between Pal and Ppi) and Leol (loop between L'P2 and 
L'P3), respectively. Not all of the residues involved in 
AdoMet binding were conserved between fungi and 
metazoa in both Pafl and Leol (Supplementary Figure 
S7A and B), and we could not detect any interaction 
between the Pafl/Leol subcomplex and AdoMet in 
solution using isothermal titration calorimetry 
(Supplementary Figure S8). Additionally, during the 



screening with additives, we found that the cyclic AMP 
molecule, which has a structure similar to AdoMet, can 
also facilitate crystal packing (Supplementary Figure 
S3D). Taken together, these results indicate that the intro- 
duction of AdoMet facilitates crystal packing rather than 
performing any specific biological function. 

PAFIC in yeast is composed of five subunits, including 
Pafl, Leol, Ctr9, Cdc73 and Rtfl (5,6), and PAFIC in 
human contains six subunits, including Ski8/Wdr61 (7). 
However, the Co-IP results shown in Figure 3 indicate 
that Rtfl is a less stable subunit in human PAFIC, con- 
sistent with several other studies (19,45,46). Furthermore, 
the histone modification domain (HMD) in Rtfl can 
promote histone H3 methylation and H2B ubiquitylation, 
independent of the other subunits in PAFIC (47). The 
HMD lacks the region required for interaction with 
other PAFIC subunits (11), indicating that Rtfl alone 
can accomplish some functions. 

In our study, we found that both Pafl and Leol can 
bind to the histone H3 tail (Figure 4A), which is somewhat 
inconsistent with two recent studies showing that PAFIC 
interacts with the histone H3 tail only through the Pafl 
subunit (28,43). It is possible that we used a longer H3 tail 
(28 aa) than was used in the other studies. Xu and col- 
leagues showed that the asymmetrically dimethylated 
histone H3 at R17 (H3R17me2a) binds to the Cdc73/ 
Ctr9/Ski8 subunits of PAFIC (43) and that the unmodi- 
fied H3 tail binds to the Pafl subunit. These data, together 
with the results of this study, indicate that PAFIC may 
use different subunits to recognize native and/or various 
posttranslational modifications of histone in the context of 
temporal and spatial regulation. 

Previous studies have estabhshed a role for PAFIC in 
H2B monoubiquitylation (15,16) and histone H3 methy- 
lation (15,17-20,40,42,43). Histones H2B and H3 are the 
substrates for the corresponding histone modification 
enzymes. In this study, we showed that the heterodimer 
of Pafl/Leol subcomplex is necessary for specifically 
binding to both histone octamer and nucleosome in vitro 
(Figures 4 and 5). Taken together, these results indicate 
that PAFIC may act as a platform or mediator to facih- 
tate the histone modification process. Future studies are 
required to test this hypothesis. 
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